Parallel Performance Evaluation of Sequence Nucleotide Alignment on the Supercomputer BlueGene/P

نویسندگان

  • PLAMENKA BOROVSKA
  • VESKA GANCHEVA
  • STOYAN MARKOV
چکیده

Bioinformatics is a scientific area requiring powerful computing resources for exploring large sets of biological data. Sequence alignment is an important method in DNA and protein analysis. BLAST has become the most popular tool and implements a fast heuristic method for sequence alignment and searching. The goal of this paper is to estimate the scalability of parallel sequence alignment on the supercomputer BlueGene/P for the case study of investigating the interaction between influenza virus A and the host genome. Parallel performance evaluation of sequence alignment have been performed experimentally on the basis of parallel mpiBlast program implementation and conducted on a local mirror database comprising the available isolates of the influenza virus A and the human genome. The molecular biology outcome of the experiments is that the similarity of influenza virus A and human genome have been determined. Key-Words: Biocomputing, High Performance Computing, Human Genome, Influenza Virus, mpiBLAST, Parallel Performance, Sequences Alignment.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scaling of Parallel Software for Biological Sequences Alignment and Homology Search on the Supercomputer BlueGene/P

The goal of this paper is to propose the performance evaluation of the scaling of parallel software for biological sequence alignment and homology searching based on blast algorithm for sequence searching and clustalw algorithm for multiple sequence alignment on the supercomputer BlueGene/P for the case study of influenza virus sequences variability and homology searching with human genome.

متن کامل

Computational Challenges in Biological Sequence Processing and In-silico Molecular Biology Experiments

Biological sequence processing is a key of information technology for molecular biology. This scientific area requires powerful computing resources for exploring large sets of biological data. The huge amount of biological sequences accumulated in the world nucleotide and protein databases requires efficient parallel tools for structural genomic and functional analysis. The paper describes the ...

متن کامل

Computational Aspects of In-silico Experiments for Investigating the Impact of the Host Genome on the Influenza Virus A Variability

Nowadays the study of the variability of influenza virus is a problem of very great importance. Influenza type A viruses cause epidemics and pandemics. The problem of restricting the spreading of pandemics and the treatment of the people infected by the influenza virus is widely based on the latest achievements of molecular biology, bioinformatics and biocomputing, as well as many other advance...

متن کامل

Optimization of Multiple Sequence Alignment Software ClustalW

* Corresponding author. E-mail address: [email protected] ‡ Corresponding author. E-mail address: [email protected] † Corresponding author. E-mail address: [email protected] Abstract This activity with the project PRACE-2IP is aimed to investigate and improve the performance of multiple sequence alignment software ClustalW on the supercomputer BlueGene/Q, so-called JUQUEEN, for the case study o...

متن کامل

Available online at www.prace-ri.eu Partnership for Advanced Computing in Europe

In silico biological sequence processing is a key task in molecular biology. This scientific area requires powerful computing resources for exploring large sets of biological data. Parallel in silico simulations based on methods and algorithms for analysis of biological data using high-performance distributed computing is essential for accelerating the research and reducing the investment. Mult...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011